The Wavelet and Fourier Transforms in Feature Extraction for Text-Dependent, Filterbank-Based Speaker Recognition
نویسندگان
چکیده
An important step in speaker recognition is extracting features from raw speech that captures the unique characteristics of each speaker. The most widely used method of obtaining these features is the filterbank-based Mel Frequency Cepstral Coefficients (MFCC) approach. Typically, an important step in the process is the employment of the discrete Fourier transform (DFT) to compute the spectrum of the speech waveform. However, over the past few years, the discrete wavelet transform (DWT) has gained remarkable attention, and has been favored over the DFT in a wide variety of applications. This work compares the performance of the DFT with the DWT in the computation of MFCC in the feature extraction process for speaker recognition. It is shown that the DWT results in significantly lower order for the Gaussian Mixture Model (GMM) used to model speech and marginal improvement in accuracy. © 2011 Published by Elsevier B.V. "
منابع مشابه
Contourlet-Based Edge Extraction for Image Registration
Image registration is a crucial step in most image processing tasks for which the final result is achieved from a combination of various resources. In general, the majority of registration methods consist of the following four steps: feature extraction, feature matching, transform modeling, and finally image resampling. As the accuracy of a registration process is highly dependent to the fe...
متن کاملText-Dependent Speaker Recognition Using Emotional Features and Neural Networks
This paper deals with a novel feature extraction method for text dependent speaker recognition. Four female speakers were used to create a text –dependent database for Malayalam (one of the south Indian languages). Discrete Wavelet Transform was used for feature extraction and artificial neural network was used for machine intelligence. In this work we used emotional features for speaker recogn...
متن کاملSpeaker Identification Using Discrete Wavelet Transform
Corresponding Author: Shanthini Pandiaraj Department of Electronics and Media Tech., Karunya University, Coimbatore, India Email: [email protected] Abstract: This study presents an experimental evaluation of Discrete Wavelet Transforms for use in speaker identification. The features are tested using speech data provided by the CHAINS corpus. This system consists of two stages: Feature extract...
متن کاملA Fast Localization and Feature Extraction Method Based on Wavelet Transform in Iris Recognition
With an increasing emphasis on security, automated personal identification based on biometrics has been receiving extensive attention. Iris recognition, as an emerging biometric recognition approach, is becoming a very active topic in both research and practical applications. In general, a typical iris recognition system includes iris imaging, iris liveness detection, and recognition. This rese...
متن کاملWavelet LPC With Neural Network for Speaker Identification System
In this study, an average framing linear prediction coding (AFLPC) technique for text-independent speaker identification systems is proposed.The study of the combination of modified LPC with wavelet transform (WT), termed AFLPC, is presented for speaker identification based on our previous paper. The study procedure is based on feature extraction and voice classification. In the phase of classi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011